Semi-Supervised Technical Term Tagging With Minimal User Feedback
نویسندگان
چکیده
In this paper, we address the problem of extracting technical terms automatically from an unannotated corpus. We introduce a technology term tagger , that is based on Liblinear Support Vector Machines and employs linguistic features including Part of Speech tags and Dependency Structures, in addition to user feedback to perform the task of identification of technology related terms. Our experiments show the applicability of our approach as witnessed by acceptable results on precision and recall.
منابع مشابه
Semi-supervised and unsupervised categorization of posts in Web discussion forums using part-of-speech information and minimal features
Web discussion forums typically contain posts that fall into different categories such as question, solution, feedback, spam, etc. Automatic identification of these categories can aid information retrieval that is tailored for specific user requirements. Previously, a number of supervised methods have attempted to solve this problem; however, these depend on the availability of abundant trainin...
متن کاملQUOTE: "Querying" Users as Oracles in Tag Engines a Semi-Supervised Learning Approach to Personalized Image Tagging
One common trend in image tagging research is to focus on visually relevant tags, and this tends to ignore the personal and social aspect of tags, especially on photoblogging websites such as Flickr. Previous work has correctly identified that many of the tags that users provide on images are not visually relevant (i.e. representative of the salient content in the image) and they go on to treat...
متن کاملSemi-supervised and Unsupervised Methods for Categorizing Posts in Web Discussion Forums
Semi-supervised and unsupervised methods for categorizing posts in web discussion forums Krish Perumal Master of Science Graduate Department of Computer Science University of Toronto 2016 Web discussion forums are used by millions of people worldwide to share information belonging to a variety of domains such as automotive vehicles, pets, sports, etc. They typically contain posts that fall into...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012